Partitioning Algorithms for the Computation of Average Iceberg Queries
نویسندگان
چکیده
Iceberg queries are to compute aggregate functions over an attribute (or set of attributes) to nd aggregate values above some speci ed threshold. It's di cult to execute these queries because the number of unique data is greater than the number of counter buckets in memory. However, previous research has the limitation that average functions were out of consideration among aggregate functions. So, in order to compute average iceberg queries e ciently we introduce the theorem to select candidates by means of partitioning, and propose POP algorithm based on it. The characteristics of this algorithm are to partition a relation logically and to postpone partitioning to use memory e ciently until all buckets are occupied with candidates. Experiments show that proposed algorithm is a ected by memory size, data order, and the distribution of data set.
منابع مشابه
Partitioning based algorithms for approximate and exact Iceberg Queries
In many applications it is necessary to identify items which occur frequently within the data set which may be a materialized or non materialized relation Such queries were recently denoted as iceberg queries Several algorithms for computing iceberg queries were presented including an approximation algorithm based on concise sampling and an exact algorithm based on sampling combined with multip...
متن کاملFr{'e}chet and Hausdorff Queries on $x$-Monotone Trajectories
vspace{0.2cm}In this paper, we design a data structure for the following problem. Let $pi$ be an $x$-monotone trajectory with $n$ vertices in the plane and $epsilon >0$. We show how to preprocess $pi$ and $epsilon$ into a data structure such that for any horizontal query segment $Q$ in the plane, one can quickly determine the minimal continuous fraction of $pi$ whose Fr{'e}chet and Hausdo...
متن کاملComputing Iceberg Queries Eeciently
Many applications compute aggregate functions over an attribute (or set of attributes) to nd aggregate values above some speci ed threshold. We call such queries iceberg queries, because the number of abovethreshold results is often very small (the tip of an iceberg), relative to the large amount of input data (the iceberg). Such iceberg queries are common in many applications, including data w...
متن کاملComputing Iceberg Queries Efficiently
Many applications compute aggregate functions over an attribute (or set of attributes) to find aggregate values above some specified threshold. We call such queries iceberg queries, because the number of abovethreshold results is often very small (the tip of an iceberg), relative to the large amount of input data (the iceberg). Such iceberg queries are common in many applications, including dat...
متن کاملبهبود الگوریتم انتخاب دید در پایگاه داده تحلیلی با استفاده از یافتن پرس وجوهای پرتکرار
A data warehouse is a source for storing historical data to support decision making. Usually analytic queries take much time. To solve response time problem it should be materialized some views to answer all queries in minimum response time. There are many solutions for view selection problems. The most appropriate solution for view selection is materializing frequent queries. Previously posed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000